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ACOUSTIC ECHO AND NOISE CANCELLATION 

BACKGROUND OF THE INVENTION 
Field Of The Invention 

[0001] The invention relates to a filter comprising at least two inputs for receiving at least 

two signals, and an output for supplying an output signal. 
5 [0002] The invention fiirther relates to a signal processing device comprising such a filter. 

[0003] The invention further relates to a teleconferencing system, a voice-controlled 

electronic device, and a noise cancellation system, 

[0004] The invention fiirther relates to a method for filtering at least two signals and for 

supplying an output signal. 
10 [0005] Recent developments in audio and video systems require the use of multiple channel 

processing and reproduction with acoustic echo cancellers (AEC) and noise cancellers. For example 
in mini-group video conferencing systems, multiple channel transmission leads to a better 
"localization" of the diverse people in the room. This enhances the intelligibility and naturalness of 
the speech. 

15 Further, multiple channel echo cancellation is needed in voice-controlled stereo audio and video 
equipment, such as television receivers, radio receivers, CD players, etc. 

[0006] A multiple channel AEC can, in general, not be created by a simple combination of 

multiple single chaimel AEC's. 

2 0 Description Of The Related Art 

[0007] U.S. Patent 5,828,756 discloses a stereophonic commxmication system, such as a 

teleconferencing system, which involves selectively reducing the correlation between the individual 
channel signals of the stereophonic system. Herein, non-linearities are added to the input signals to 
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reduce the correlation. However by adding these non-linearities, audible artifacts in the output 
signals are introduced. These non-linearities can (sometimes) be accepted in teleconferencing 
systems but are certainly not acceptable in other applications, such as supplying music, etc. 

SUMMARY OF THE INVENTION 
[0008] It is, inter alia, an object of the invention to provide a filter, which overcomes the 

objections of the prior art. To this end a first aspect of the invention provides a filter described in the 
opening paragraph, characterized in that the coefBcient updates are determined in a transformed 
domain, and the filter comprises means for reducing the effect of correlation between the i9nput 
signals on the coefficient updates. 

[0009] Herewith, the performance of the adaptive filters is improved without a huge increase 

in computational complexity. 

BRIEF DESCRIPTION OF THE DRAWINGS 
15 [0010] The invention and additional features, which may be optionally used to implement the 

invention to advantage, will be apparent from and elucidated with references to the examples 
described below, hereinafter and shown in the accompanying drawings, in which: 
[0011] Figure 1 schematically shows the multiple input adaptive FIR filter, according to the 

invention; 

2 0 [0012] Figure 2 schematically shows the calculation of the output of the FIR filter, according 

to the invention; 

[0013] Figure 3 schematically shows the calculation of Y in the Multiple Input Partitioned 

Frequency Domain adaptive filter, according to the invention for the case with direct inverse power 
estimation; 
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[0014] Figure 4 schematically shows the calculation of the coefficient vectors w. according 

to the invention; 

[0015] Figure 5 shows a schematic example of a stereo echo cancellation in a 

teleconferencing system according to the invention, 
5 [0016] Figure 6 shows a more detailed schematic example of a teleconferencing system 

according to the invention; 

[0017] Figure 7 shows a schematic example of a voice controlled device according to the 

invention; 

[0018] Figure 8 shows a schematic example of a noise canceller according to the invention; 

10 and 

[0019] Figure 9 schematically shows the determining of the input channel's power matrix. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0020] In the description, the equations, matrices, etc., are shown as described below. Signals 

15 are denoted by lower case characters, constants by upper case. Underlining is used for vectors, lower 
case for time domain, and upper case for frequency domain. Matrices are denoted by bold face upper 
case, e.g., I. The dimension is put in superscript (e.g., IheBxQ matrix X is given by X^^Q, for a 
square matrix the second dimension is omitted). Diagonal matrices are denoted by a double 
underline, e.g., P, with its diagonal denoted as A subscript /, e.g., w/, denotes the z'th 

20 version. The A:'th element of w is given by (w)^. Finally, appending [k] denotes the time index, (.)^ 
denotes the transpose, (.)* denotes the complex conjugate, and (.)^ denotes the Hermitain transpose 
(complex conjugate transpose). 

[0021] A general multiple input adaptive FIR filter, depicted in Figure 1, uses the S signals 

Xq [k] to x^_i [k] to remove imwanted components correlated with these signals in the signal e[k], 
25 The signals Xq [k] to ;c^_i [k] are input to S FIR-filters Wq to W^_^ , with outputs [k] to [k]. The 
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goal of the update algorithm is to adapt the coefficients of the PER filters in such a way that the 

correlation between r[k] and the input signals Xq [k] and x^.j [k] is removed. 

[0022] For 5 > a > 0 , the FIR filter Wa performs the convolution of the signal Xa[k] and the 

coefficients Wa,o[k] . . . Wa,N-i[k] of that filter. The output signal ea[k] of such a filter can be described 

as follows 



/=0 



(1) 



10 



With,for 5>fl>0, 



Xa[k] J 



(2) 



(3) 



15 [0023] 



The output of the multiple input adaptive filter is given by 



a=0 



(4) 



[0024] These filter parts of the separate (adaptive) filters Wo until Ws-i can be implemented 

2 0 efficiently in fi-equency domain with help of partitioning, block processing and Discrete Fourier 
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Transforms (DFTs). A reduction in computational complexity is obtained since the convolutions per 
sample in the time domain transform to element-wise multiplications per block in the frequency 
domain. We vise block processing with block length B and DFTs of length M, with M >N + B-\. 
The transformation of the input signals can be described for 5 > a > 0 by 



X"[kB]=F*' 



^x^[kB-M + \l 



(5) 



10 



where F** is the M x M Fourier matrix. The (a,b)'1h element (for 0 < a < M,0 < ft < M )of the Fourier 
matrix is given by 



- jlmb 



M 

where j = 4-i. 



(6) 



15 



[0025] 



The filter can then be computed in the frequency domain, by 



a=0 



^e[kB-B + \]\ 
e[kB] 



(7) 



Note that the frequency domain filter coefficients are related to the time domain coefficients, for all 
5 > a > 0 this can be denoted by 



M 
M 



£^[/]=F^ 



(8) 



V - / 
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[0026] To obtain an efficient implementation, the block length B must be chosen in the same 

order as the filter length which results a large processing delay. 

[0027] To reduce the processing delay, the filter can be partitioned into smaller pieces of 

length B, and with g = lN/ B\v/e get the implementation of Figure 2, that can be described by 

e[kB]=(o e/p-O*" N (9) 

a=0 1=1 a 



with£"[/]=F" 



0 



w 



[0028] For the update part of the filter, one can use S separate update algorithms to improve 

1 0 convergence behavior the input signals can be decorrelated separately in the time domain by using 
RLS like algorithms, leading to a huge computational complexity. Complexity reduction can be 
obtained by implementation in the frequency domain with (partitioned) Block Frequency Domain 
Adaptive Filters, as described in G.P.M. Egelmeers, Real time realization of large adaptive filters, 
Ph.D. thesis, Eindhoven University of Technology, Eindhoven (The Netherlands), Nov. 1995. When 
1 5 there is correlation between the input signals of the filters, this might still lead to very bad 
convergence behavior, due to the non-uniqueness problem. 

[0029] In this application, it is proposed to use a partitioned algorithm in the frequency 

domain that reduces the effect of the cross-correlation between the input signals on the algorithm's 
convergence behavior. To reduce complexity, block processing, with block length A to compute the 
2 0 sum of A consecutive updates with each iteration, is used. For 5 > > 0 , the coefficient vectors 
[IA] are partitioned into gu parts of length Z with gu = / ^1 such that, for S> J>0 
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with for 5 > 7 > 0 and g„ > z > 0 



A Fourier transform length L is used with L>Z + A-\,'wq define the input signal Fourier 
transforms for 5 > a > 0 as 



10 



The diagonal matrices \IA\ contain the vector [/^] as main diagonal, so for 5 > a > 0 



15 
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An overlap-save method is used to compute the correlation involved in the adaptation process in the 
frequency domain, the frequency domain transform of the residual signal vector equals 



The set of update equations for the filter coefficients in the MFDAF (Multiple Input Frequency 
Domain Adaptive Filter) algorithm can now be defined for g„ > / > 0 by 



'wL.,[(/+i)^r 
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Z,S.L 



{vl[lA-iZ]) 



R'[IA] 



15 



with 



= 2a(p"[M])"'. 
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and the transformation matrix C^ ^^^is given by 



5 G 



S.ZS.L _ 





... O^''^ 















with 



10 



The input chaimers power matrix P*^^ [IA] is defined by 



[14] = 1 fx" " [U^ .(X" ' [U'^f } 



15 



where 
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x"-^[/^]= 



V=0L J. 



The expectation operator e{} of the above equation has to be replaced by an estimation routine. 

5 

[0030 J The power matrix P \IA\ can be estimated by 

[/^]=(i-;.)p^-^[(/-i)^]-f ^( x"'^ [lA] y .(X (10) 

L 

[0031] To reduce the number of multipUcations, the step-size parameter a of equation is 

incorporated in the above power estimation routine by defining 

10 Pf^[L4] = — P"M (11) 
2a 

so 

2a(p^[/4-'=(pf M)"'. (12) 
Estimation of the power matrix P^^ [IA] can then be done by 

15 

^!'liA]=(i-r)-K'[(i-i)Ah^i x--M/^] r •( x"-M/^] )'• (13) 

Direct application of this algorithm leads to stability problems. When the input signal power in a 
certain frequency bin is very small, the power in that bin will decrease to a (very) small value. The 
inverse of the matrix will then have large values and will be inaccurate (due to numerical and 
2 0 estimation errors) In the ideal case, the eigenvalues of the power matrix estimate cancel the 
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eigenvalues of the input signal power matrix. Due to estimation errors, this goal is only 
approximated, and the mismatch introduces a deviation from the ideal convergence behavior and 
might even lead to instability. Especially when some of the eigenvalues of the estimate of the inverse 
power matrix get large, and do not (exactly) cancel a (small) eigenvalue of the input signal power 
5 matrix, instability might occur. A lower limit to the eigenvalues of the estimate of the power matrix 
can solve this problem. In the single channel case (or when we forget the cross-terms), we can solve 
this problem by applying a lower limit to the lower values. We can do this because the eigenvalues 
of a diagonal matrix are equal to the elements of the diagonal, so we actually limit the eigenvalues. 
In the multiple channel case we also have to limit the eigenvalues to assure stability, but these no 

1 0 longer equal the elements on the diagonal. 

[0032] We know, however, that all eigenvalues of the power matrix are positive. We can now 

create a lower limit on the eigenvalues by shifting them by the suggested minimum. We know that 
for all eigenvalues Xofa matrix A, the determinant of A+(P^i^ - A') -I must be zero. So for all Ji\ 
eigenvalue of A+P^-^-l, there must be a X, eigenvalue of A, such that /l'=A.+Pmm (and the other 

1 5 way around). This means that by adding a constant Pmin to the main diagonal of a matrix, all 
eigenvalues of that matrix are shifted by Pmin-)- So we define: 



2a 



2 0 which results in 



This is shown in Figure 9 of the drawings. 

[0033] The effect of this shifting of the eigenvalues on the (theoretical ideal) convergence 

2 5 behavior of the algorithm will be very small, and in practice the algorithm is much more stable. 
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[0034] Although V^f^ [IA] is a sparse matrix, computing its inverse would still require L 

inversions of SxS matrices, which takes on the order of L-S^ operations. As we, however, do not 
need the inverse itself, but only its matrix-vector product with the input signals, we can also look at it 
as solving the system 



which requires on the order of L-S^ operations. Another option is to estimate the inverse of 
P^^ [IA] directly, which also results in a number of operations proportional to LS^ , However, also 
in this case, we have to limit the eigenvalues to assure stability. 
[0034] A simple algorithm is given by: 

with 
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Q''-'[u] = (l>"-ll-\Af\ 



[lAj 
MM) 



=(p^^[/-u])"'.(x"M^])* 



(14) 



5 [0035] 



We can incorporate a, which results in 



10 



15 



[0036] The above algorithm does not guarantee a matrix (p^*^^ [m]) ' with positive 

eigenvalues, and, therefore, introduces a lot of stability problems. In the single channel case, we are 
able to stabilize the algorithm by using a lower limit on the estimate, which automatically results in 
positive eigenvalues because the matrix is diagonal, but that is not possible in the multiple channel 
case. 

Positive eigenvalues 



[0037] An exact transformation of an algorithm for estimating P^^ [IA] with positive 

eigenvalues will lead to an estimation algorithm for the inverse with positive eigenvalues. This can 
20 be done by using the matrix inversion lenmia. When there is matrix A such that 

A=B + C D E (15) 
then the inverse matrix (A)'^ of A can be expressed by 

(A)'' =(B)-' -(B)-' 'C'((D)-' +E'(B)-' -C)-' -E.^;"* • (16) 
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(17) 



By choosing 

B = [(/-l)^] 
C = (X''''[lA]y 

D = 

2aL 

5 and 

Q = (B) ' c 

= E.(B)-' (18) 

= 7^(p"[(/-i)4"'-(x^''M)' 

i-r 

we obtain by using Equation (14) 



10 (K'llAy =-^¥^'[(l-l)Ay -Q''''-[lA]-(D'-[lAy -(Q^^-^M)* (19) 

i-r 



with 



B''[lA]=—l' +iX''''-[lA]y -Q'-'-'llA]- (20) 
- / 

15 

[0038] Equation (19) involves no matrix inversion, and only Z, / 2 + 1 divisions, as the matrix 

[IA] of Equation (20) is a real valued diagonal matrix. 
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Limits on the eigenvalues 



[0039] An operation on the inverse power matrix that is equivalent to adding a constant to the 

5 diagonal of the (non-inverse) power matrix, would solve the problem. Adding a full (5 • Z) x (S • L) 
identity matrix and trying to find an equivalent operation on the inverse power matrix with the 
matrix inversion lenmia, results in an algorithm that requires the matrix inversion we would like to 
avoid, so we try 

p:LN=(i-rtP:t[('-i)4+^(xL"M)-(xL"M)' (21) 

10 

with 



{^^''W -(xL^'lM])' =(x^-^'^M)* .(x^-^-^[M]y -loLi'' ■ ill) 

1 5 [0040] As the matrix I has rank 5 • I and the product matrices (X^^^ [IA])* • (Xf^^'^ [Z^])' 

and (X^ '"^ [lA\y • (X^ '"^ [IA]) ' both have a rank of (at most) L, this is not possible for S>1 . As we 
need the average of (X^"^'^ [IA])* • (X^ '-'- [u])' , we can find a solution by taking the average over S 
consecutive updates. We will try to find X^'^ [IA] such that 



\s-\ 



20 



[a=0 



(23) 



= ^ X (X^-^-^ [(/ + a) A])' . (X'''' [(/ + a)A]y \ + P^^2ccLS- 1'' 



la=Q 



with for i = / mod 5 
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XL"[t4]= 



5,'M 



+ 



(24) 



with 



5 = 



^0,1 "o,o y 



For = 1 , we get 



(25) 



(26) 



[0041] For 5 > 1 , there are an infinite number of solutions. If we try to keep the maximum 

10 distortion (the largest matrix element) as small as possible, we have to choose, for all 5 > 7 > 0 and 
5 > / > 0 

Uj,=±ylP^-2ocL. (27) 



15 



If there is a real symmetric matrix for S=L, then a real symmetric matrix U^^ for S = 2L is given 
by 



(28) 
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[0042] Using the above equation, we can construct all with / > 0 . If S -hi is not a power 

of two, then we will use the matrix U^' where 2' > 5 + 1 > 2'"\ and use the last S rows. In Table 1, 
the power matrix estimation algorithm using a direct inverse estimation with limits on the 
5 eigenvalues is sunraiarized. 



PHNL000172-SS-101005 



17 



PHNL 000172 

Initialization 

1. S =2'"*^^2'^"^^1 



for i = \to log2 (5„ )do 
2. begin 

u 



3. Initialize power matrix. 



Iteration 



10 



1. xL^'^M= 



fx' Ml 

= 5-1 J 


f 




+ 




V 



2. Q^'M/^]=7^ te[(/-i)4" -(xL'-n/^])* 

l-y 

- r 

4. Ca/cM/a^e(D^M)"' 

(Pa';L[/4-'=7^(p:L[(/-i)^])-' 

5. 1-r 

Table 1 : Direct inverse power update with limits. 



15 
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[0043] Note that the inverse of P*^^ [^4] is also a sparse matrix with the same structure, and 

we define 

5 where for 0 < / < 5 and 0 < 7 < 5 
T;;.M = ^/agfe[/4. 

[0044] Figure 5 shows, schematically, an example of a teleconferencing system TS5 using a 

10 stereo echo canceller SEC5 with adaptive filters AF5 (only one shown). The teleconferencing system 
comprises a far room FR5 and a near room NR5. The adaptive filter AF5 has to filter the stereo echo 
signals. 

[0045] Figure 6 shows an example of a stereo echo canceller SEC6 used in a teleconferencing 

system TS6. The stereo echo canceling has to be performed between the near room NR6 and the far 

15 room FR6. In this example, also programmable filters PF61 and PF62 are used to improve the 
performance of echo canceling. Programmable filters are described in U.S. Patent 4,903,247. 
[0046] Further, also the output of the progranmiable filters is supplied to a dynamic echo 

suppressor DES6, which is coupled with an output to the output of the stereo echo canceller. 
Dynamic echo suppressors are described in International Patent Application No. WO-97-45995, 

20 corresponding to U.S. Patent 6,546,099. 

[0047] A fiiU stereo communication requires four stereo AECs, two on the near end side and 

two on the far end side. In Figure 6, only one of these echo cancellers is depicted. Note that on each 
side we can combine the input signal delay-lines, the FFTs and the multiplication by the inverse 
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power matrix of the two echo cancellers, which implies that the relative extra computational 
complexity for removing the cross-correlation is even further reduced. The performance of the AECs 
is further improved by adding Dynamic Echo Suppressors as shown. 

[0048] Figure 7 shows another application wherein a stereo echo canceller SEC7 is used in a 

5 voice-controlled audio (and video) system VCS7. To be able to recognize the local speaker by a 
voice recognition engine, we have to cancel the soxmd emitted by the audio set through the 
loudspeakers. This is done by using the stereo echo canceller SEC7. To improve the stereo echo 
canceling, also, in this example, the programmable filters PF71 and PF72 are used, and the Dynamic 
Echo Suppresser DES7 is used. The output of the Dynamic Echo Suppressor is coupled to a voice 

1 0 recognizer VR7 for handling the filtered signal. 

[0049] Figure 8 shows an example of a noise canceller NC8 for canceling the noise received 

on microphones in a room R8 together with a speech signal spl from a person in the room. In this 
example, the microphones supply signals to a beam former BF8 which beam former supplies signals 
to the noise canceller NC8 and to programmable filters PF81, PF82 and PF83. Further the noise 

15 canceller comprises a dynamic echo suppressor DES8. The output of the Dynamic Echo Suppressor 
is coupled to the output of the noise suppressor to supply an estimate of the received speech sp2. 
[0050] Also in the multiple input noise canceller, we can apply a DES (which is, in fact, not 

suppressing an echo, but is similar to the DES in the AECs case) and programmable filters to 
improve performance, as shown in Figure 8. An extra problem is that the inputs of the filters may 

2 0 contain some elements of the desired signal ("signal leakage"), because the beam former is not 
perfect. When the desired signal is speech signal, a speech detector can be used to improve the 
behavior of the MFDAF. 

[0051] Above some examples of application of a stereo echo canceller and of a noise 

canceller are described. It is to be noted that the invention can be used in different applications and is 
2 5 not restricted to the described applications. 
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